Picture for Wilson Y. Lee

Wilson Y. Lee

How Many Human Judgments Are Enough? Feasibility Limits of Human Preference Evaluation

Add code
Jan 15, 2026
Viaarxiv icon

Lessons from the Trenches on Reproducible Evaluation of Language Models

Add code
May 23, 2024
Figure 1 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 2 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 3 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Figure 4 for Lessons from the Trenches on Reproducible Evaluation of Language Models
Viaarxiv icon

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Add code
Nov 09, 2022
Viaarxiv icon

Evolving Label Usage within Generation Z when Self-Describing Sexual Orientation

Add code
Aug 29, 2022
Figure 1 for Evolving Label Usage within Generation Z when Self-Describing Sexual Orientation
Figure 2 for Evolving Label Usage within Generation Z when Self-Describing Sexual Orientation
Figure 3 for Evolving Label Usage within Generation Z when Self-Describing Sexual Orientation
Figure 4 for Evolving Label Usage within Generation Z when Self-Describing Sexual Orientation
Viaarxiv icon

Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP

Add code
Dec 20, 2021
Figure 1 for Between words and characters: A Brief History of Open-Vocabulary Modeling and Tokenization in NLP
Viaarxiv icon